Agent Mode for “Serious” Developers: Autonomous Coding with VS Code

Session Date: May 21, 2025
Duration: 15 minutes
Venue: Build 2025 Conference - DEM519
Speaker: Devin Valenciano (Senior Product Manager, VS Code Team, Microsoft)
Link: Microsoft Build 2025 Session DEM519

Introduction and Session Overview
Real-World Challenge Setup
Agent Mode Live Demonstration
VS Code Platform Advantages
Technical Deep Dive: Data Science Pipeline
Performance Results and Metrics
Limitations and Considerations
Educational Impact and Learning Acceleration
- Knowledge Transfer Mechanisms
- Skill Development Opportunities
References

Introduction and Session Overview

Session Opening

Timeframe: 00:00:00 - 00:00:29
Duration: 29s
Speaker: Devin Valenciano

Devin Valenciano introduces himself as an Associate Product Manager (APM) on the VS Code team, setting the stage for a demonstration that challenges traditional perceptions about who can be a “serious developer.” The session’s core premise revolves around democratizing advanced development capabilities through Agent Mode.

“My name is Devin Valenciano… I am APM on the VS Code team. And like Leslie just said, I’m here to talk about Agent mode for serious developers.”

Agent Mode Concept

Timeframe: 00:00:29 - 00:01:09
Duration: 40s
Speaker: Devin Valenciano

The fundamental introduction to Agent Mode establishes it as more than traditional AI assistance—it’s positioned as an autonomous coding partner capable of iterative problem-solving and independent progress.

“Agent mode’s this really cool thing that’s built to the copilot that allows you to solve bigger problems. It’s this autonomous coding assistant that can iterate on its own progress.”

Key Capabilities Introduced:

Autonomous iteration - Independent problem-solving progression
Complex problem handling - Multi-step, interconnected challenges
Role democratization - Enabling non-experts to achieve expert-level results

Democratization of Development

Timeframe: 00:00:21 - 00:00:29
Duration: 8s
Speaker: Devin Valenciano

A crucial philosophical statement that frames the entire demonstration: Agent Mode as an equalizing force in technical development, allowing product managers and non-technical professionals to engage with complex programming challenges.

“One of the coolest things it does is that it lets APM like me pretend to be a serious developer for a little bit or a serious data scientist as we’re about to show.”

Real-World Challenge Setup

Kaggle Competition Introduction

Timeframe: 00:00:29 - 00:00:47
Duration: 18s
Speaker: Devin Valenciano

The session establishes credibility by selecting a genuine competitive programming challenge rather than a contrived demo, positioning Kaggle as the data science equivalent of LeetCode for algorithmic challenges.

“So the example that I’m going to run you all through is this real piece of work. So this is what’s called a Cagle competition. It’s the data science equivalent of Leap code.”

Problem Specification

Timeframe: 00:00:47 - 00:01:05
Duration: 18s
Speaker: Devin Valenciano

The housing price estimation challenge represents a complex machine learning problem with real-world applicability, requiring sophisticated data analysis, feature engineering, and model selection techniques.

Problem Components:

Training dataset - Historical housing data with price labels
Test dataset - Houses requiring price prediction
Sample submission - Required output format
Data description - Feature documentation and context

Competition Context

Timeframe: 00:00:38 - 00:00:47
Duration: 9s
Speaker: Devin Valenciano

The competitive environment provides authentic constraints and evaluation metrics, ensuring the demonstration reflects real-world development pressures and performance standards.

“And they put full competitions online for people to put submissions up and and try and solve difficult problems in data science.”

Agent Mode Live Demonstration

Initial Setup and Context

Timeframe: 00:01:06 - 00:01:41
Duration: 35s
Speaker: Devin Valenciano

The demonstration begins with minimal human preparation—only a prompt file and competition data files—emphasizing Agent Mode’s ability to work from minimal starting context.

“So you can see I’ve got a couple things set up here and that’s it. I’ve got a prompt that I’m going to use to prompt agent mode. And I’ve got the the files that I just showed you pre downloaded.”

Starting Materials:

Custom prompt file for Agent Mode guidance
Competition data files (train.csv, test.csv, data_description.txt, sample_submission.csv)
VS Code with Copilot Agent Mode enabled

Autonomous Problem Analysis

Timeframe: 00:01:43 - 00:02:30
Duration: 47s
Speaker: Devin Valenciano

Agent Mode immediately demonstrates human-like problem-solving behavior by systematically analyzing the competition requirements and data structure without explicit instruction.

“And it does something great right away. It says I’m going to go fetch this URL that has the actual competition data and it’s using the built in fetch tool to go ahead and do that.”

Autonomous Analysis Steps:

URL Fetching - Automatic retrieval of competition description
File Reading - Systematic examination of data files
Data Sampling - Preview of dataset structure (first 10 lines)
Context Building - Understanding problem requirements and constraints

Technical Implementation Workflow

Timeframe: 00:02:08 - 00:02:45
Duration: 37s
Speaker: Devin Valenciano

The human-like analytical approach showcases Agent Mode’s sophisticated reasoning patterns, mirroring how experienced data scientists approach new problems.

“It does exactly what a human would do, right? It goes and reads the description file and reads the whole thing. And it’s like, OK, I understand a little bit about what this data is looking like.”

VS Code Platform Advantages

Native Notebook Integration

Timeframe: 00:02:45 - 00:03:15
Duration: 30s
Speaker: Devin Valenciano

VS Code’s competitive advantage emerges through seamless integration between Agent Mode and Jupyter notebooks, creating a unified development environment for data science workflows.

“So for those who don’t know, VS Code has some really incredible built in native notebook support. And this is like a primary tool among data scientists in the field.”

Integration Benefits:

Automatic notebook creation - Agent Mode generates appropriate project structure
Kernel management - Intelligent Python environment selection
Cell execution control - User permission system for code safety
Professional workflows - Industry-standard data science practices

Model Selection Capabilities

Timeframe: 00:03:22 - 00:03:47
Duration: 25s
Speaker: Devin Valenciano

The demonstration highlights VS Code Copilot’s multi-model support, allowing users to select optimal AI models for specific problem types and personal preferences.

“So right now we’re using the clod 3.7 sonnet model. I love clod 3.7 sonnet. It’s my favorite of all the ones listed here, but we have plenty to choose from within Copilot.”

Available Models:

Claude 3.5 Sonnet - Advanced reasoning capabilities
Claude 3.7 Sonnet - Preferred for deep analysis (demonstrated)
GPT-4 Turbo - Microsoft partnership integration
GPT-4.1-O - Latest OpenAI model
Gemini - Google model (preview availability)

Built-in Tool Ecosystem

Timeframe: 00:04:25 - 00:04:49
Duration: 24s
Speaker: Devin Valenciano

The prompt file system represents a significant productivity enhancement, enabling reusable development patterns and knowledge capture across projects.

“So this allows you to save custom prompts that can be used across multiple projects that can be tied to AVS code profile.”

Tool Ecosystem Features:

Prompt file storage - Reusable instruction patterns
Profile integration - Personalized development workflows
Cross-project application - Consistent approaches across domains
Efficiency optimization - Elimination of repetitive setup tasks

Technical Deep Dive: Data Science Pipeline

Environment Setup and Package Management

Timeframe: 00:07:22 - 00:08:00
Duration: 38s
Speaker: Devin Valenciano

Agent Mode demonstrates sophisticated dependency management by autonomously selecting and installing appropriate packages for the machine learning challenge without explicit guidance.

“So this just installs all the required packages. I didn’t tell it which packages to go install. It decided based on the problem set to which packages would be most useful in solving the Cagle competition.”

Automatically Selected Packages:

Data Manipulation: pandas, numpy
Visualization: matplotlib, seaborn
Machine Learning: scikit-learn (multiple modules)
Advanced ML: XGBoost
Statistical Analysis: scipy

Exploratory Data Analysis

Timeframe: 00:08:59 - 00:10:05
Duration: 1m 6s
Speaker: Devin Valenciano

The systematic data exploration phase reveals Agent Mode’s understanding of data science best practices, including data quality assessment and distribution analysis.

“And so next it jumps right into the exploratory data analysis. So it, it does what any human would do and it’s, it reads in the data and it starts to take a look at what the data actually is and what it can be used for.”

Analysis Components:

Data Loading - Systematic file reading and structure examination
Missing Value Detection - Quality assessment and data completeness
Distribution Analysis - Statistical property examination
Variable Correlation - Feature relationship identification

Data Preprocessing and Feature Engineering

Timeframe: 00:11:51 - 00:12:04
Duration: 13s
Speaker: Devin Valenciano

The preprocessing phase showcases advanced data science techniques, including log transformation for data normalization and categorical variable encoding.

“So it’s handling missing values, it’s transforming categorical variables, so variables that are, you know, in a string format into something that’s more usable for these machine learning models.”

86-Line Data Cleansing Implementation:

Missing value imputation - Strategic handling of incomplete data
Categorical encoding - String-to-numerical conversion
Feature scaling - Normalization for model optimization
Data validation - Quality assurance across datasets

Model Training and Evaluation

Timeframe: 00:12:26 - 00:13:20
Duration: 54s
Speaker: Devin Valenciano

The multi-algorithm approach demonstrates sophisticated machine learning strategy, training seven different models with systematic performance evaluation.

Basic Model Suite:

Linear Regression - Statistical baseline approach
Ridge Regression - Regularized linear model
Lasso Regression - Feature selection through regularization
Elastic Net - Combined Ridge/Lasso approach

Advanced Model Suite:

Random Forest - Ensemble tree-based method
Gradient Boosting - Sequential improvement algorithm
XGBoost - Optimized gradient boosting framework

Advanced Ensemble Methods

Timeframe: 00:13:37 - 00:14:01
Duration: 24s
Speaker: Devin Valenciano

The ensemble creation represents professional-grade machine learning technique, combining multiple model predictions for improved accuracy and robustness.

“But it actually does something even more advanced where it takes the the, the seven models that were generated and all those sets and it creates what’s called an ensemble.”

Ensemble Benefits:

Improved accuracy - Leveraging strengths of multiple approaches
Reduced overfitting - Balancing individual model limitations
Robust predictions - Consistent performance across data variations
Competition-grade technique - Industry-standard advanced methodology

Performance Results and Metrics

Competition Results

Timeframe: 00:14:41 - 00:15:01
Duration: 20s
Speaker: Devin Valenciano

The quantified success metrics provide concrete evidence of Agent Mode’s effectiveness, achieving professional-level results in a competitive environment.

“And so I got a score of 14,000 on this, which equates to about 300th place out of 6000. And submissions… So top 5% generated in about 10 minutes time with very little, you know, serious development on my end.”

Final Performance Metrics:

RMSE Score: 14,000
Competition Ranking: 300th out of 6,000 submissions
Percentile Performance: Top 5%
Development Time: Approximately 10 minutes
Human Intervention: Single prompt + execution permissions

Development Time Analysis

Timeframe: 00:07:35 - 00:07:43
Duration: 8s
Speaker: Devin Valenciano

The time efficiency comparison illustrates the dramatic acceleration in development workflows, from hours or days to minutes for complete data science pipelines.

“Even if I don’t know exactly what library does which thing I can go then research. It gives me a starting point to then go learn more efficiently.”

Efficiency Gains:

Traditional Approach: Hours to days for complete solution
Agent Mode Approach: 10 minutes for competition-ready submission
Setup Elimination: Automatic package selection and environment configuration
Learning Acceleration: Immediate exposure to expert-level techniques

Learning Efficiency Gains

Timeframe: 00:02:30 - 00:02:39
Duration: 9s
Speaker: Devin Valenciano

The educational value emerges through observation of professional development patterns, enabling rapid skill acquisition through practical application.

“It’s just saving me a ton of time and, and actually probably preventing me from learning a little bit, but it’s a pretty learning tool to, to, to work alongside.”

Limitations and Considerations

Non-Deterministic Behavior

Timeframe: 00:04:49 - 00:05:02
Duration: 13s
Speaker: Devin Valenciano

The session candidly addresses Agent Mode’s limitations, including unpredictable execution patterns that require human oversight and intervention.

“And unfortunately with agentic development flow, we have these little moments where it’s like, ah, shoot, I know this should work a certain way, but it’s not quite deterministic.”

Practical Challenges:

Unpredictable pausing - Agent may require continuation prompts
Execution variability - Non-deterministic workflow progression
Human supervision - Need for guidance during complex operations
Backup strategies - Importance of fallback plans for demonstrations

Ethical Usage Guidelines

Timeframe: 00:14:34 - 00:14:41
Duration: 7s
Speaker: Devin Valenciano

Clear ethical boundaries establish appropriate use cases for Agent Mode, emphasizing learning and development over competitive advantage in existing systems.

“And again, I don’t recommend that people are, you know, submitting Cagle submissions. And this isn’t the right way to for this to be done, but it’s a really quick way to learn.”

Best Practices

Timeframe: 00:10:05 - 00:10:12
Duration: 7s
Speaker: Devin Valenciano

The demonstration emphasizes the importance of user control and permission systems to prevent unauthorized or malicious code execution.

“We don’t want an agentic mode to just keep spinning and spinning and and eventually spending compute that it doesn’t need to compute.”

Recommended Approaches:

Learning and education - Primary use case for skill development
Prototyping and exploration - Rapid proof-of-concept creation
Starting point generation - Foundation code for human refinement
Pattern learning - Understanding professional development workflows

Educational Impact and Learning Acceleration

Knowledge Transfer Mechanisms

Timeframe: 00:11:04 - 00:11:08
Duration: 4s
Speaker: Devin Valenciano

Agent Mode serves as an educational accelerator by exposing users to advanced techniques and best practices they might not discover independently.

“But with agent mode, I can start to learn about these topics a lot faster.”

Learning Benefits:

Pattern Recognition - Observing expert-level problem-solving approaches
Technique Exposure - Introduction to advanced methodologies
Best Practice Integration - Learning industry-standard workflows
Rapid Iteration - Quick experimentation with different approaches

Skill Development Opportunities

Timeframe: 00:10:51 - 00:11:04
Duration: 13s
Speaker: Devin Valenciano

The demonstration highlights how Agent Mode enables exploration of advanced concepts that would typically require extensive domain expertise to discover and implement.

“And so it’s, it’s looking at all these different facets of the data that I personally would never think to explore because I’m not a professional data scientist.”

Educational Applications:

Data Science Methodology - Complete analytical workflow understanding
Feature Engineering - Variable transformation and selection techniques
Model Selection - Algorithm comparison and evaluation methods
Performance Optimization - Advanced techniques like ensemble methods

Appendix: Technical Implementation Details

Prompt File Configuration

The session references a custom prompt file system that enables reusable development patterns. While the specific prompt content isn’t detailed in the transcript, the system allows for:

Consistent Instructions - Standardized approach across projects
Profile Integration - Personalized development workflows
Cross-Project Reuse - Efficient knowledge transfer between domains

Model Performance Comparison

The automatic model selection and comparison process included systematic evaluation of seven different algorithms:

Traditional Statistical Methods:

Linear Regression (baseline)
Ridge Regression (L2 regularization)
Lasso Regression (L1 regularization)
Elastic Net (combined regularization)

Advanced Machine Learning:

Random Forest (ensemble tree-based)
Gradient Boosting (sequential optimization)
XGBoost (optimized gradient boosting)

Data Quality Assessment

The exploratory data analysis phase included comprehensive data quality evaluation:

Missing Value Analysis - Systematic identification of incomplete data
Distribution Examination - Statistical property assessment and visualization
Correlation Matrix - Feature relationship analysis for variable selection
Transformation Requirements - Log transformation for data normalization

References

Agent Mode and VS Code Documentation

VS Code Copilot Documentation - Comprehensive guide to Copilot features including Agent Mode capabilities, setup instructions, and best practices for autonomous coding assistance. Essential for understanding the technical foundation of the demonstrated capabilities.
Jupyter Notebook Support in VS Code - Detailed documentation on VS Code’s native notebook integration, which provides the foundation for Agent Mode’s data science workflow capabilities demonstrated in the session.

Data Science and Machine Learning Resources

Kaggle Learn - Free micro-courses in data science and machine learning that provide foundational knowledge for understanding the techniques Agent Mode applied in the housing price prediction challenge.
Scikit-learn Documentation - Comprehensive documentation for the machine learning library that Agent Mode automatically selected and utilized for model training and evaluation in the demonstration.
XGBoost Documentation - Technical documentation for the advanced gradient boosting library that Agent Mode autonomously incorporated into the ensemble model approach.

Competitive Programming and Data Science

Kaggle Housing Prices Competition - The actual competition referenced in the demonstration, providing context for the problem complexity and competitive benchmarks achieved by Agent Mode.
Machine Learning Ensemble Methods - Technical explanation of ensemble methods like the one Agent Mode automatically implemented, relevant for understanding the sophisticated approach demonstrated.

AI Model Integration

Claude Model Documentation - Understanding Claude 3.7 Sonnet capabilities and optimal usage patterns, which was the preferred model used in the demonstration for its deep analysis capabilities.
GitHub Copilot Best Practices - Best practices for effective AI pair programming that complement the autonomous capabilities demonstrated in Agent Mode.

Microsoft Build 2025 Context

Microsoft Build 2025 Sessions - Complete session catalog from Build 2025 conference, providing broader context for the developer tools and AI capabilities being introduced across Microsoft’s ecosystem.
VS Code Team Updates - Regular updates on VS Code feature releases and improvements, including Agent Mode development and related AI-powered development tools.

This session demonstrates the transformative potential of Agent Mode in democratizing advanced development capabilities, enabling developers and non-developers alike to achieve professional-grade results through autonomous AI assistance. The real-world Kaggle competition success validates the practical applicability of agentic development tools in solving complex, multi-step problems with minimal human intervention.